Gene identification in novel eukaryotic genomes by self-training algorithm

نویسندگان

چکیده

منابع مشابه

Gene identification in novel eukaryotic genomes by self-training algorithm

Finding new protein-coding genes is one of the most important goals of eukaryotic genome sequencing projects. However, genomic organization of novel eukaryotic genomes is diverse and ab initio gene finding tools tuned up for previously studied species are rarely suitable for efficacious gene hunting in DNA sequences of a new genome. Gene identification methods based on cDNA and expressed sequen...

متن کامل

Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training.

We describe a new ab initio algorithm, GeneMark-ES version 2, that identifies protein-coding genes in fungal genomes. The algorithm does not require a predetermined training set to estimate parameters of the underlying hidden Markov model (HMM). Instead, the anonymous genomic sequence in question is used as an input for iterative unsupervised training. The algorithm extends our previously devel...

متن کامل

using game theory techniques in self-organizing maps training

شبکه خود سازمانده پرکاربردترین شبکه عصبی برای انجام خوشه بندی و کوانتیزه نمودن برداری است. از زمان معرفی این شبکه تاکنون، از این روش در مسائل مختلف در حوزه های گوناگون استفاده و توسعه ها و بهبودهای متعددی برای آن ارائه شده است. شبکه خودسازمانده از تعدادی سلول برای تخمین تابع توزیع الگوهای ورودی در فضای چندبعدی استفاده می کند. احتمال وجود سلول مرده مشکلی اساسی در الگوریتم شبکه خودسازمانده به حسا...

Self-organizing Approach for Automated Gene Identification in Whole Genomes

An approach based on evolutionary consideration and very simple and clear idea of distinguished coding phase in explicit form for identification of protein-coding regions in whole genome has been proposed. For several genomes the optimal window length for averaging GC-content function and calculating codon frequencies has been found. It is shown that the structure of distribution of triplet fre...

متن کامل

Computational Identification and Characterization of Repeats in Sequenced Eukaryotic Genomes

Repetitive sequences or repeats are often called “junk DNA”, for they do not seem to provide any sequence specific function in the genome in general. These sequences are ubiquitous and abundant in all species examined to date. It is generally believed that repeats have profound impact on genome evolution and genome organization. The recent availability of whole genome sequences has opened a new...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Nucleic Acids Research

سال: 2005

ISSN: 0305-1048,1362-4962

DOI: 10.1093/nar/gki937